ITALIAN JOURNAL OF PURE AND APPLIED MATHEMATICS-N. 26-2009 (219-228) 



219 



OPTIMUM STATISTICAL TEST PROCEDURE 



Rajesh Singh 

Department of Statistics 
BHU, Varanasi (U.P.) 

India 

e-mail: rsinghstat@yahoo.com 

Jayant Singh 

Department of Statistics 
Rajasthan University 
Jaipur 
India 

e-mail: Jayantsinghf 7@rediffmail. com 

Florentin Smarandache 

Chair of Department of Mathematics 

University of New Mexico 

Gallup 

USA 

e-mail: smarand@unm.edu 



Abstract. In this paper we search for the optimum tests that minimize the sum of two 
error probabilities. 

Keywords and Phrases: Testing hypotheses, Neyman-Pearson lemma, optimality, 
normal distribution, chi-square distribution with n degrees of freedom, exponential dis- 
tribution, optimum test, critical region, variance. 

AMS Subject Classification: 62F03. 



1. Introduction 

Let X be a random variable having probability distribution P(X /6 ), 6 G 0. It 
is desired to test H 0 : d G 0 against Hi : d G ©i = 0 — O 0 . Let S denote 
the sample space of outcomes of an experiment and x— (xi,x 2 , ■■■,x n ) denote an 
arbitrary element of S. A test procedure consists in diving the sample space into 
two regions W and S — W and deciding to reject H 0 if the observed x G W. The 
region W is called the critical region. The function 7 (d) = Pg(x G W) = Pg(W), 
say, is called the power function of the test. 

We consider first the case where ©o consists of a single element, d 0 and its 
complement ©i also has a single element 61 . We want to test the simple hypothesis 
Hq : 6 = 9q against the simple alternative hypothesis Hi : d — d\. 
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Let L 0 = L(X/H 0 ) and L\ = L(X/Hi) be the likelihood functions under H 0 
and H\ respectively. In the Neyman-Pearson set up the problem is to determine 
a critical region W such that 



(1) 


7(0o) 


= P*,(W) = 


/ Lodx = a, an assigned value, and 








Jw 


(2) 


7(0i) 


= P*>(W) = 


/ Lida: is maximum 

Jw 



compared to all other critical regions satisfying (1). If such a critical region exists 
it is called the most powerful critical region of level a. 

By the Neyman-Pearson lemma, the most powerful critical region Wo for tes- 
ting Ho : 6 = 9q against Hi : 6 = 6\ is given by 

Wo = {x : Li > kL 0 } 

provided there exists a k such that ( 1 ) is satisfied. 

For this test y(do) = ot and y(0 1 ) — ► 1 as n — » a. 

But for any good test we must have 7 (^ 0 ) ~ ^ 0 and 7 (^ 1 ) — » 1 as n — » 00 
because complete discrimination between the hypotheses H 0 and Hi should be 
possible as the sample size becomes indefinitely large. 

Thus, for a good test it is required that the two error probabilities a and f3 
should depend on the sample size n and both should tend to zero as n — > 00 . 

We describe below test procedures which are optimum in the sense that they 
minimize the sum of the two error probabilities as compared to any other test. 
Note that minimizing (a + (3 ) is equivalent to maximising 

1 — (a + (3) = (1 — /3) — a = Power-Size. 

Thus, an optimum test maximises the difference of power and size as compared 
to any other test. 



Definition 1 . A critical region Wo will be called optimum, if 



( 3 ) 




— / L 0 dx > 
Jw 0 





Lodx , 



for every other critical region W . 

Lemma 1. For testing H 0 : 9 = 9 0 against Hi : 0 = 9i the region 

W 0 = {x \ Li> L 0 } is optimum. 



Proof. Wo is such that inside W 0 , L 1 > L 0 and outside W 0 , L 1 < L 0 . Let W be 
any other critical region 



(Li — L 0 )dx — / ( Li — Lo)dx = 



(L 1 — L 0 )dx — 



(Li - L 0 )dx, 



'Wo 



'w 



'w 0 nw c 



'wnw, 



0 



by subtracting the integrals over the common region Wo D W 



> 0 
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since the integrand of first integral is positive and the integrand under second 
integral is negative. Hence (3) is satisfied and Wq is an optimum critical region. 



Example 1. Consider a normal population N(6,a 2 ), where cr 2 is known. It is 
desired to test H 0 : 6 = 6 0 against Hi : 9 = 9\, 9\ > d 0 . 



L(x/9) 




v- O, '>)' 

Ac 2<r 2 

o z=l 



U 

L 0 



EK-eu 2 
e 2 a 2 



E^i-9 0 ) 2 

e 2^2 



The optimum test rejects H 0 if 



u 

L 0 



> 1 , 



i.e., if log y— > 0, 



i.e., if 



ZX^i-0 1) 2 , E0u-#o) 



2a 2 



+ 



2a 2 



> 0 , 



i.e., if 29i E x i~ n9f - 2 'p 0 E U + n @l > 0, 
i.e., if (26>i - 2d 0 ) E^i > rc(d? - d^), 

•f E ^ $1 + #0 

i.e., if > — - — , 

n 2 

i.e., if x > 



Thus the optimum test rejects H 0 if x > ^ 



We have a = Ph 0 
Under H 0 , X ~ °° 



_ . 9i + d 0 
a; > — - — 



= Pi 



Ho 



x-9p > y/n(9 1 - d 0 ) 
<j ! \fn ~~ 2a 






follows N( 0, 1) distribution. 



a = 1 _^(M£ L zM 

\ 2(7 



where <f> is the c.d.f. of an N(0, 1) distribution 



P = Pi 



H i 



_ di+do 
x < — - — 



= Pi 



H i 



X - 9 1 ^ y/n{6i - do)’ 

= < 2a 



cr Un 
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Under H x , 



_ follows N(Q, 1) distribution 






Here a = (3. It can be seen that a — (3 — > 0 as n — > oo. 

Example 2. For testing H 0 : 6 = 9q against H 0 : 6 = 9i,9i < 9 0 , the optimum 
test rejects H 0 when x < 1 ^ 0 • 

Example 3. Consider a normal distribution N{9,a 2 ), 6 known. It is desired to 
test H 0 : a 2 = Ug against Hi : a 2 = af, a 2 > o\. We have 



L(x/a 2 ) = 



1 \ 2 — 



27T<J 2 J 



u 

Lo 



i) ^ 

E Gi-e ) 2 



1 

log — 



Ulog.?-log^)- E(l -- (,)2 ■ U *'~ 6)2 



2 o-f 



+ 



2°o 






2^2 



at o', 



i u o 



The optimum test rejects H 0 if — > 1 

Lo 



Le -> if E7E2 ~JT ~ 0 ) 2 - T ( logC7 i “ log 



!.(>., if 



2o"|0'g ~ 2 

6») 2 no-? 



a n 



an 



> 2 2 

af -^6 



(log a! — log a 0 ) 



i.e., if X) 



(xi-6 



V 0o 



> nc, 



where c = ^ ^ (log a 2 — logag). 



af — a 0 



Thus the optimum test rejects H 0 if 




2 



> nc. 




OPTIMUM STATISTICAL TEST PROCEDURE 



223 



Note that under H 0 : — follows iV(0, 1) distribution. Hence ^ ( Xl 



H 0 



V 



follows under H 0 , a chi-square distribution with n degrees of freedom (d.f.). Here 



a = Pi 



H 0 



£ 



Xj - Op 

Ho 



> nc 



= P [*(„) > nc 



and 



1-/3 = Pi 



Hi 



E 



(xi-9 



V o-o 



> nc 



= P 



Hi 



Xj-6 \ ncal 
hq ) ~ n\ 



= P 



Hi 



x Li > 



nca , 



(n) - 0-2 



1 J 



Xi- a 



follows a chi-square distribution with n d.f. 



Note that under Hi, ( 

V ° 1 

It can be seen that a —> 0 and (3 — > 0 as n — > oo. 

Example 4. Let X follow the exponential family distributions 

f(x/6) = c (6)e Q{e)nx)h{x) . 

It is desired to test H 0 : 9 = 9 0 against H\ \ 9 — 9\ 

L(x/9 ) = [c(9)] n e Q ^ T ^ Yl h(xi ) 



L\ 



The optimum test rejects H 0 when log — > 0 

Lo 

i.e., when [Q(9 1 ) - q(9 0 )] ^ t(xi) > nlog 



n log 



i.e., when J2P( x i) — 



c(9q 
c{9 1 



[Q(9 1 ) - Q(0o)] 



c(6>i 



if g(0i)-g(0 o )>o 



c(0. 



and rejects H 0 , when ]T T(x i ) < — wy[g(6*i) - Q(0 O )] Q(^i) - <2(#o) < 0. 

c{9 1 ) 



Locally Optimum Tests 

Let the random variable X have the probability distribution P(x/9). We are 
interested in testing H 0 : 9 = 9 0 against Hi : 9 > 9 q. If IT is any critical region, 
then the power of the test as a function of 9 is 

7 (9) = P g (W) = I L(x/9)dx. 

Jw 
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We want to determine a region W for which 

7 (0) - 7(0o) = [ L(x/9)dx - [ L(x/0 ) 

J w Jw 

is a maximum. 

When a uniformly optimum region does not exist, there is not a single region 
which is best for all alternatives. We may, however, find regions which are best for 
alternatives close to the null hypothesis and hope that such regions will also do 
well for distant alternatives. We shall call such regions locally optimum regions. 
Let 7 (9) admit Taylor expansion about the point 9 = 9q. Then 

7 (9) = 7 (^ 0 ) + (9 — 9o) r y'(9o) + $ where 5 — ■> 0 as 9 — » 9o 

7(0)-7(0o) = (0-0o)V(0o) + 5. 

If 1 9 — 0q| is small, 7 (9) — j(0o) is maximized when 7 / ($o) is maximized. 



Definition 2. A region Wq will be called a locally optimum critical region if 



(4) 



L\x/9o)dx > / L\x/9o)dx 



'Wo 



'w 



for every other critical region W. 



Lemma 2. Let W 0 be the region {x : L'(x/9 0 ) > L(x/9 0 )}. Then W 0 is locally 
optimum. 

Proof. Let W 0 be the region such that inside it L'(x/9 0 ) > L(x/9 0 ) and outside 
if L\x/9q) < L(x/ 2 0 ). Let W be any other region. 



L'(x/9 0 )dx— L'(x/9o)dx 



'w 0 



<w 



'w 0 rw c 



L\x/9 0 )dx — 
L'(x/9 0 )dx + 



’w 0 nw c 



> 



L(x/9 0 )dx 



L'(x/9o)dx 

v 

L'(x/9o)dx 

v 

L(x/9o)dx 



>w 0 nw c 



since L\x/9 0 )dx > L(x/9 0 ) inside both the regions of the integrals. 
> 0, since L(x/9 0 ) > 0 in all the regions. 

Hence [ L\x/9 0 )dx > [ L'(x/9o)dx for every other region W. 



'Wo 



'w 



To prove (*) we have / L(x/9 0 )dx + / L(x/9 0 )dx = 1 for every region R. 

Jr J r c 

Differentiating, we have 

[ L\x/9 0 )dx + [ L'{x/9o)dx = 0 



' R 



I R c 



L'(x/9 0 )dx = — L\x/9o)dx = 0. 



' R 



iRc 
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In (*), take R c = Wq fl W and the relation is proved. 

Similarly, if the alternatives are Hi : 9 < 9 0 , the locally optimum critical 
region is 

{x : L'(x/6 0 ) < L(x/6 0 )}. 



Example 5. Consider N(9,a 2 ) distribution, a 2 known. It is desired to test 
H 0 : 9 = 9 0 against H\ : 9 > 9 0 



L(x/9 ) = 



K aj2ir 
log L(x/9) = n log 



-EUi-fl) 2 

e 2^2 



E(*i - 



aj2nj 2cr 2 

L'{x/9 ) 5 log L(x/9) J2{xi~9) n(x — 9) 

L(x/9 ) = 69 = ^ = 

L'{x/ 9 q ) _ n( x - 6> 0 ) 

a 2 



cr- 



L(x/9 0 ) 

The locally optimum test rejects H 0 , if 

n(x - 9o ) 



<7 



Now, 



a — Pi 



Ho 



U 



<7 



> 1 i.e. x >9 o H . 

n 



x > 9 q 



= P; 



Ho 



X - 9n 



n 



> a/Jn 



cr \/n 



x — 0\ 

= 1 — <3> ( o j Jn ) , since under H 0 , — -—= follows 1V(0, 1) distribution. 

o7 Jn 



1-/3 = P Hl [x>6 o + cr 2 /n\ 

= P Hl 



x -9 1 > 9i - 9 0 
cr/x/n - a / \fn yfn 



since under Hi, 



= l-$ 
x — 9i 



~(9i - 9 0 Jn) _u_ 

o Jn 



07 \ n 



follows IV (0, 1) distribution. 



Exercise. If 9 0 = 10, 9\ =11, a = 2, n = 16, then a = 0.3085, 1 — (3 — 0.9337, 
Power-Size = 0.6252. 




226 



RAJESH SINGH, JAYANT SINGH, FLORENTIN SMARANDACHE 



X — 9n 

If we reject H 0 when — —= > 1.64, then a = 0.05, 1 — (3 = 0.6406, Power-Size 

a/ y/n 

= 0.5906. Hence Power-Size of locally optimum test is greater than Power-size of 
the usual test. 

Locally Optimum Unbiased Test 

Let the random variable X follows the probability distribution P(x/9). Suppose 
it is desired to test H 0 : 9 = 9 0 against H L : 9 y 9 0 . We impose the unbiasedness 
restriction 

7(0)>7(*o), e?e 0 

and 7 (9) — j(9 0 ) is a maximum as compared to all other regions. If such a region 
does not exist, we impose the unbiasedness restriction j'(9 o) = 0 . 

Let 7 (9) admit Taylor expansion about the point 9 = 9 0 . Then 

7(0) = 7(«o) + (0 - »o)Y(»o) + h_hi! 7 »( 0 o ) + n 
where r) — > 0 as 9 — > 0. 

7(0) - 7(«o) = (0 - #o)7'(0o) + - 7 "( 0 o ) + r, 

Under the unbiasedness restriction Y(9 0 ) = 0, if 1 9 — 9 0 1 is small j(9) — r y(9 0 ) is 
maximized when y'(9 0 ) is maximized. 

Definition 3. A region IU 0 will be called a locally optimum unbiased region if 

(5) j'(9 0 )=[ L'(x/9 0 )dx = 0 

JWo 

and 

(6) y\9 0 ) = [ L"(X/9 0 )dx > [ L"(X/9 0 )dx 

JWo -'w 



for all other regions W satisfying (5). 

Lemma 3. Let W 0 be the region 

{x : L"(x/9q) > L(x/9 0 )}. 



Then W 0 is locally optimum unbiased. 
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Proof. Let W be any other region 

f L"(x/9o)dx — f L"(x/9 0 )dx 
JWo Jw 

= f L”(x/9o)dx+ f L"(x/9o)dx 
Jw 0 nw c Jw 0 nw 

by subtracting the common area of W and Wq . 

= f L”(x/9 0 )dx + f L"(x/9o)dx, 

Jw 0 nw c Jw 0 nw c 

since L"(x/9 0 ) > L[x/9 ) inside W 0 and outside W. 
> 0 since L(x/9) > 0. 



Example 6. Consider N(9,a 2 ) distribution, a 2 known. 

H 0 :9 = 9 0l H 1 = 9^9 0 



L = 



1 \ n EUi-g ) 2 



K 0\f2 7T, 

L"[xj9 ) n 2 (x — 9) 2 n 

L(x/9) a 4 cr 2 



Locally optimum unbiased test rejects H ( 



o 



n 2 (x - 9 0 ) 2 n 

if > 1 i.e., 

cr 4 a 2 



n(x — 9 0 )“ _ _ cr 



1 7 ^ 



2 cr 2 

> 1 H 

n 



Under H 0 , follows X 2 ^ distribution. 



cu 



yi) 



Testing Mean of a normal population when variance is unknown 

Consider N(9,a 2 ) distribution, a 2 known. For testing H 0 : 9 = 9 0 against 
Hi : 9 — 9i, the critical function of the optimum test is given by 



(x = 



1 if L(x/9i) > L(x/9 0 ) 
0 otherwise. 



On simplification, we get 



(x = 



x = 



i ^ + 9\ 

1 if x > — - — if ll\ > /i 0 

0 otherwise. 

1 ^ ^0 + $1 

1 if x > — - — if tii < /i 0 
0 otherwise. 
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Consider the case when cr 2 is unknown. 

For this case we propose a test which rejects H$ when 

L(x/9 1 ) 



L(x/9 0 



> 1 



where L{xj9f), (?’ = 0,1), is the maximum of the likelihood under Hi obtained 
from L(x/9i) by replacing cr 2 by its maximum likelihood estimate 

of = - J2(xj -9i) 2 ; i = 0,1. 

3 = 1 

Let 4> p (x) denote the critical function of the proposed test, then 

1 if L(x/9i) > L(x/9 0 ) 



Ud = 



On simplification, we get 



x = 



x = 



0 otherwise 



^ 9 0 + 6>i 

1 U X > if ylil > flQ 

0 otherwise. 

1 . f _ ^ 9 q + 6>i 

1 if x > — - — if iii < /i 0 

0 otherwise. 



Thus, the proposed test 4> p (x) is equivalent to 4> m (x ), which is the optimum test 
that minimizes the sum of two error probabilities (a + (5). Thus, we see that 
one gets the same test which minimizes the sum of the two error probabilities 
irrespective of whether a 2 is known or unknown. 

Acknowledgement. Authors are thankful to Prof. Jokhan Singh for his gui- 
dance in writing this article. 
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